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a representative cDNA sequence for KIAA0866." 

/ codon_s t art = 1 

/product="KIAA0866 protein" 

/protein_id="BAA74889 . 2" 

/db_xref="GI : 27529744" 

/translat ion= " SPQLEIWDQQGTMAQKGQLSDDEKFLFVDKNFINSPVAQADWAA 
KRLVWVPSEKQGFEAASIKEEKGDEVWELVENGKKVTVGKDDIQKMNPPKFSKVEDM 
AELTCLNEASVLHNLRERYFSGLIYTYSGLFCVWNPYKHLPIYSEKIVDMYKGKKRH 
EMPPHIYAIADTAYRSMLQDREDQSILCTGESGAGKTENTKKVIQYLAWASSHKGKK 
DTSITGELEKQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVTGYIVGANIETY 
LLEKSRAIRQARDERTFHI FYYMIAGAKEKMRSDLLLEGFNNYTFLSNGFVPI PAAQD 
DEMFQETVEAMAIMGFSEEEQLSILKWSSVLQLGNIVFKKERNTDQASMPDNTAAQK 
VCHLMGI NVTDFTRS I LTPR I KVGRD WQKAQTKEQADFAVEALAKAT YERLFRWI LT 
RVNKALDKTHRQGASFLGI LDIAGFE I FEVNS FEQLCINYTNEKLQQLFNHTMFI LEQ 
EEYQREGIEWNFIDFGLDLQPCIELIERPNNPPGVLALLDEECWFPKATDKSFVEKLC 
TEQGSHPKFQKPKQLKDKTEFS I IHYAGKVDYNASAWLTKNMDPLNDNVTSLLNAS SD 
KFVADLWKDVDRIVGLDQMAKMTESSLPSASKTKKGMFRTVGQLYKEQLGKLMTTLRN 
TTPNFVRCI I PNHEKRSGKLDAFLVLEQLRCNGVLEGIRICRQGFPNRIVFQEFRQRY 
E I LAANAI PKGFMDGKQACILMI KALELDPNLYRIGQSKI FFRTGVLAHLEEERDLKI 
TD V I MAFQ AMCRG YLARKAFAKRQQQLTAMKV I QRNCAA YLKLRNWQWWRLFT KVK P L 
LQVTRQEEEMQAKEDELQKTKERQQKAENELKELEQKHSQLTEEKNLLQEQLQAETEL 
YAEAEEMRVRLAAKKQELEEILHEMEARLEEEEDRGQQLQAERKKMAQQMLDLEEQLE 
EEEAARQKLQLEKVTAEAKI KKLEDEI LVMDDQNNKLSKERKLLEERI SDLTTNLAEE 
EEKAKNLTKLKNKHESMISELEVRLKKEEKSRQELEKLKRKLEGDASDFHEQIADLQA 
QIAELKMQLAKKEEELQAALARLDDEIAQKNNALKKIRELEGHISDLQEDLDSERAAR 
NKAEKQKRDLGEELEALKTELEDTLDSTATQQELRAKREQEVTVLKKALDEETRSHEA 
QVQEMRQKHAQAVEELTEQLEQFKRAKT^NLDKNKQTLEKENADLAGELRVLGQAKQEV 
EHKKKKLEAQVQELQSKCSDGERARAELNDKVHKLQNEVESVTGMLNEAEGKAIKLAK 
DVASLSSQLQDTQELLQEETRQKLNVSTKLRQLEEERNSLQDQLDEEMEAKQNLERHI 
STLNIQLSDS KKKLQDFAS T VE ALE EG KKR FQKE I ENLTQQ YEE KAAAYD KLE KTKNR 
LQQELDDLWDLDNQRQLVSNLEKKQRKFDQLLAEEKNISSKYADERDRAEAEAREKE 
TKALS LARALEEALEAKEELERTNKMLKAEMEDLVS S KDDVGKNVHELEKS KRALETQ 
MEEMKTQLEELEDELQATEDAKLRLEVNMQALKGQFERDLQARDEQNEEKRRQLQRQL 
HE YETELEDERKQRALAAAAKKKLEGDLKDLELQADSAI KGREEAI KQLRKLQAQMKD 



FQRELEDARASRDEI FATAKENEKKAKSLEADLMQLQEDLAAAERARKQADLEKEELA 
EELASSLSGRNALQDEKRRLEARIAQLEEELEEEQGNMEAMSDRVRKATQQAEQLSNE 
LATERSTAQKNESARQQLERQNKELRSKLHEMEGAVKSKFKSTIAALEAKIAQLEEQV 
EQEAREKQAATKSLKQKDKKLKEILLQVEDERKMAEQYKEQAEKGNARVKQLKRQLEE 
AEEESQRINANRRKLQRELDEATESNEAMGREVNALKSKLRRGNETSFVPSRRSGGRR 
VIENADGSEEETDTRDADFNGTKASE » 



ORIGIN 



Alignment Scores: 

Pred. No.:. 0 Length: 6846 

Score: 9602.50 Matches: 1929 

Percent Similarity: 99.64% Conservative: 0 

Best Local Similarity: 99.64% Mismatches: 0 

Query Match: 98.93% Indels: 7 

DB: 9 Gaps: 1 

-2 (1-1945) X AB020673 (1-6846) 

MetAlaGlnLysGlyGlnLeuSerAspAspGluLysPheLeuPheValAspLysAsnPhe 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGCGCAGAAGGGCCAACTCAGTGACGATGAGAAGTTCCTCTTTGTGGACAAAAACTTC 
IleAsnSerProValAlaGlrxAlaAspTrpAlaAlaLysArgLeuValTrpValProSer 

lllllllllllll II Mllllllll llllll II HUM 1111111111111111 Mill 

ATCAACAGCCCAGTGGCCCAGGCTGACTGGGCCGCCAAGAGACTCGTCTGGGTCCCCTCG 
GluLysGlnGlyPheGluAlaAlaSerlleLysGluGluLysGlyAspGluVal Valval 

1111111111111 II II II llllilllllllll llllll II Ml II I lllllllllllll 

GAGAAGCAGGGCTTCGAGGCAGCCAGCATTAAGGAGGAGAAGGGGGATGAGGTGGTTGTG 

GluLeuValGluAsnGlyLysLysValThrValGlyLysAspAspIleGlnLysMetAsn 
I I I I I I I I I I i II tl i ! I I li II I I II II :l I I I I II I 1 I I I I M I I I I I I I I I I I I I I I 

GAGCTGGTGGAGAATGGCAAGAAGGTCACGGTTGGGAAAGATGACATCCAGAAGATGAAC 
ProProLysPheSerLysValGluAspMetAlaGluLeuThrCysLeuAsnGluAlaSer 

I I I I I I I I I I ! h II I I I I II II I I II il !| I I I I II I I I I I I :l I I I I I I I I I I I I I I I 

CCACCCAAGTTCTCCAAGGTGGAGGACATGGCGGAGCTGACGTGCCTCAACGAAGCCTCC 
ValLeuHisAsnLeuArgGluArgTyrPheSerGlyLeuIleTyrThrTyrSerGlyLeu 

1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 il I ' II 1 1 II :! II 1 1 il M 1 1 1 1 ! I II 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 

GTGCTACACAACCTGAGGGAGCGGTACTTCTCAGGGCTAATATATACGTACTCTGGCCTC 
PheCysValValValAsnProTyrLysHisLeuProIleTyrSerGluLysIleValAsp 

MMM M III II MM II MMHI MINIM MUM II MIHIMIMMI IMI 



MetTyrLysGlyLysLysArgHisGluMetProProHisIleTyrAlalleAlaAspThr 160 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

ATGTACAAGGGCAAGAAGAGGCACGAGATGCCGCCTCACATCTACGCCATCGCAGACACG 555 



I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 : 1 1 M I 



lllllllll Mil MIIIMI Mil llllll IMMMIMI Mill lllllllllllll 

GGAGCCGGGAAAACCGAAAACACCAAGAAGGTCATTCAGTACCTGGCCGTGGTGGCCTCC 675 



IIMIIII III II MIIIIMMII MIIMI llllll 

TCCCACAAGGGCAAGAAAGACACAAGTATCACG GGAGAG 714 



Mllllll II I M 1 1 1 1 1 1 M 1 1 ! 1 1 1 i 1 1 llllll 1 1 lllllllll llllll 



ValLysAsnAspAsnSerSerArgPheGlyLysPhelleArglleAsnPheAspValThr 260 

MIMhlll 1 1 1 hll II illill IMIIill Ihlll 1 1 lilill IIU iMill IM 

GTGAAGAACGACAACTCCTCACGATTCGGCAAATTCATCCGCATCAACTTCGACGTCACG 834 
GlyTyrlleValGlyAlaAsnlleGluThrTyrLeuLeuGluLysSerArgAlalleArg 28 0 



US-09- 
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Qy 


1 


Db 


76 


Qy 


21 


Db 


136 


Qy 


41 


Db 


196 


Qy 


61 


Db 


256 


Qy 


81 


Db 


316 


Qy 


101 


Db 


376 


Qy 


121 


Db 


436 


Qy 


141 


Db 


496 


Qy 


161 


Db 


556 


Qy 


181 


Db 


616 


Qy 


201 


Db 


676 


Qy 


221 


Db 


715 


Qy 


241 


Db 


775 


Qy 


261 



Db 


835 


IN MINIM II llllllllllll MINIMI MUM II llllllllllllllllll 

GGTTACATCGTGGGAGCC AACATTGAGAC CT ATCTGCT AG AAAAATCACGGG CAATT CG C 


894 


Qy 

Db 


281 
895 


GlnAlaArgAspGluArgThrPheHisIlePheTyrTyrMetlleAlaGlyAlaLysGlu 

MMMIIIIMIINIIIIIIII!MIIIIIIIIIIIININNI llllllllllll 1 

CAAGCCAGAGACGAGAGGACATTCCACATCTTTTACTACATGATTGCTGGAGCCAAGGAG 


300 
954 


Qy 


301 


LysMetArgSerAspLeuLeuLeuGluGlyPheAsnAsnTyrThrPheLeuSerAsnGly 

1 M Mill il IIIIIIIMIIIII IMIIII 1 HUM II llllllllllllllllll 

AAGATGAGAAGTGACTTGCTTTTGGAGGGCTTCAACAACTACACCTTCCTCTCCAATGGC 


320 


Db 


955 


1014 


Qy 


321 


PheValProIleProAlaAlaGlnAspAspGluMetPheGlnGluThrValGluAlaMet 

in iiiiiiniiii mm iinii ii iiiii iiiiiiiiiiiiiiiiiiiiiiiiini 


340 


Db 


1015 


TTTGTGCCCATCCCAGCAGCCCAGGATGATGAGATGTTCCAGGAAACCGTGGAGGCCATG 


1074 


Qy 

Db 


341 
1075 


AlalleMetGlyPheSerGluGluGluGlnLeuSerlleLeuLysValValSerSerVal 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiii 

GCAATCATGGGTTTCAGCGAGGAGGAGCAGCTATCCATATTGAAGGTGGTATCATCGGTC 


360 
1134 


Qy 


361 


LeuGlnLeuGlyAsnlleValPheLysLysGluArgAsnThrAspGlnAlaSerMetPro 

1 llllllllllll llllllllllll III INI llllllllllllllllll IMIIII III 

CTGCAGCTTGG AAATATCGTCTTCAAG AAGGAAAG AAACACAGACCAGG CGTC CATGCCA 


380 


Db 


1135 


1194 


Qy 

Db 


381 
1195 


AspAsnThrAlaAlaGlnLysValCysHisLeuMetGlylleAsnValThrAspPheThr 

1 Mil II INI II IINII MINI INI MM II II II 1 II M II M II II 1 II 1 1 Ml 

GATAACACAGCTGCTCAGAAAGTTTGCCACCTCATGGGAATTAATGTGACAGATTTCACC 


400 
1254 


Qy 


401 


ArgSerlleLeuThrProArglleLysValGlyArgAspValValGlnLysAlaGlnThr 

1 1 II II 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 II 1 1 1 1 
AGATCCATCCT C ACTCCTCGT AT CAAGGTTGGGCGAG ATGTGGTACAG AAAG CT CAGAC A 


420 


Db 


1255 


1314 


Qy 


421 


LysGluGlnAlaAspPheAlaValGluAlaLeuAlaLysAlaThrTyrGluArgLeuPhe 

llllllll II MM MM MUM III MM II MMIM III! IMIIIMI II II 

AAAGAACAGGCTGACTTTGCTGTAGAGGCTTTGGCCAAGGCAACATATGAGCGCCTTTTC 


440 


Db. 


1315 


1374 


Qy 

Db 


441 
1375 


ArgTrpIleLeuThrArgValAsnLysAlaLeuAspLysThrHisArgGlnGlyAlaSer 

1 1 1 1 1 II 1 1 II 1 1 M 1 1 1 1 1 II 1 1 1 1 II 1 M M 1 II 1 1 1 1 1 M II 1 1 II 1 1 1 1 1 1 1 1 M 1 
CGCTGGATACTCACCCGCGTGAACAAAGCCCTGGACAAGACCCATCGGCAAGGGGCTTCC 


460 
1434 


Qy 


461 


PheLeuGlylleLeuAspIleAlaGlyPheGluIlePheGluValAsnSerPheGluGln 

lllllllllll IIIIIIIMIIIII llllllll lllllll II llllllllllllllllll 

TTCCTGGGGATCCTGGATATAGCTGGATTTGAGATCTTTGAGGTGAACTCCTTCGAGCAG 


480 


Db 


1435 


1494 


Qy 


481 


LeuCysIleAsnTyrThrAsnGluLysLeuGlnGlnLeuPheAsnHisThrMetPhelle 

INNNNIINIIIINMIMIMMINNIMNIMMMMIMNMIMM 


500 


Db 


1495 


CTGTGCATCAACTACACCAACGAGAAGCTGCAGCAGCTCTTCAACCACACCATGTTCATC 


1554 


Qy 


501 


LeuGluGlnGluGluTyrGlnArgGluGlylleGluTrpAsnPhelleAspPheGlyLeu 

MM MM llllllllll IMMIIII Mil II IMIIII IIMMMIIIIII II II 1 

CTGG AGC AGGAGGAGTACCAG CGCG AGGGCAT CG AGTGGAACTTCAT CG ACTTTGGGCTG 


520 


Db 


1555 


1614 


Qy 


521 


AspLeuGlnProCysIleGluLeuIleGluArgProAsnAsnProProGlyValLeuAla 

NNIIMINNNNNINNNMININMINNNNMNIMMNNII 


540 


Db 


1615 


GACCTACAGCCCTGCATCGAGCTCATCGAGCGACCGAACAACCCTCCAGGTGTGCTGGCC 


1674 


Qy 


541 


LeuLeuAspGluGluCysTrpPheProLyaAlaThrAspLysSerPheValGluLysLeu 

1 1 N II II II 1 1 II 1 II 1 1 II M 1 N 1 1 1 M 1 N 1 II II 1 II II II II II II N N 1 1 1 1 

CTGCTGGACGAGGAATGCTGGTTCCCCAAAGCCACGGACAAGTCTTTCGTGGAGAAGCTG 


560 


Db 


1675 


1734 


,Qy 


561 


CysThrGluGlnGlySerHisProLysPheGlnLysProLysGlnLeuLysAspLysThr 

INNNNIINIIMMMINIMNMINIMMIMMMMINIININN 

TGCACGGAGCAGGGCAGCCACCCCAAGTTCCAGAAGCCCAAGCAGCTCAAGGACAAGACT 


580 


Db 


1735 


1794 


Qy 


581 


GluPheSerllelleHisTyrAlaGlyLysValAspTyrAsnAlaSerAlaTrpLeuThr 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 
GAGTTCTCCATCATCCATTATGCTGGGAAGGTGGACTATAATGCGAGTGCCTGGCTGACC 


600 


Db 


1795 


1854 


Qy 


601 


LysAsnMetAspProLeuAsnAspAsnValThrSerLeuLeuAsnAlaSerSerAspLys 

lllllllll II NININNIMIMNMNMINII II lllllllllll IMIIII 

AAGAATATGGACCCGCTGAATGACAACGTGACTTCCCTGCTCAATGCCTCCTCCGACAAG 


620 


Db 


1855 


1914 



Qy 


621 


PheValAlaAspLeuTrpLysAspValAspArglleValGlyLeuAspGlnMetAlaLys 

MMMMIMMMIMIMM MIIMI II IMMIII IMIIII MM IMIIIIM 

TTTGTGGCCGACCTGTGGAAGGACGTGGACCGCATCGTGGGCCTGGACCAGATGGCCAAG 


640 


Db 


1915 


1974 


Qy 


641 


MetThrGluSerSerLeuProSerAlaSerLysThrLysLysGlyMetPheArgThrVal 

IIIIIMIINIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

ATGACGGAGAGCTCGCTGCCCAGCGCCTCCAAGACCAAGAAGGGCATGTTCCGCACAGTG 


660 


Db 


1975 


2034 


Qy 


661 


GlyGlnLeuTyrLysGluGlnLeuGlyLysLeuMetThrThrLexiArgAsnThrThrPro 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIMIII 

GGGCAGCTGTACAAGGAGCAGCTGGGCAAGCTGATGACCACGCTACGCAACACCACGCCC 


680 


Db' 


2035 


2094 


Qy 


681 


AsnPheValArgCysIlelleProAsnHisGluLysArgSerGlyLysLeuAspAlaPhe 

IMMMMIIII IMIIMI 1 1 1 IIMIMI IMIIMI IMMM IM 1 IMIIMM 

AACTTCGTGCGCTGCATCATCCCCAACCACGAGAAGAGGTCCGGCAAGCTGGATGCGTTC 


700 


Db 


2095 


2154 


Qy 


701 


LeuValLeuGluGlnLeuArgCysAsnGlyValLeuGluGlylleArglleCysArgGln 

IliiMIIIMIIMIMMI 1 MMMIMlMIMM IMIMIM 1 1 1 Mill MM 

CTGGTGCTGGAGCAGCTGCGGTGCAATGGGGTGCTGGAAGGCATTCGCATCTGCCGGCAG 


720 


Db 


2155 


2214 


Qy 

Db 


721 
2215 


GlyPheProAsnArglleValPheGlnGluPheArgGlnArgTyrGluIleLeuAlaAla 

IIIIIIIMIIIIMIIIIIIIIIMIMIIIIIMMIIIIIIIIIIIIMIIMIIII 

GGCTTCCCCAACCGGATCGTCTTCCAGGAGTTCCGCCAACGCTACGAGATCCTGGCGGCG 


740 
2274 


Qy 


741 


AsnAlal leProLysGlyPheMet AspGlyLysGlnAlaCysI leLeuMet IleLysAla 

NNNNNIIIIMIN Mill MINIM llllllll MINN III IIIMIIMI 

AATGCCAT CC CC AAAGGCTTCATGGACGGGAAGCAGGCCTGCATT CT CATGAT C AAAG C C 


760 


Db 


2275 


2334 


Qy 


761 


LeuGluLeiaAspProAsnLeuTyrArglleGlyGlnSerLysIlePhePheArgThrGly 

IIIIMMIIIII IMMMi III II INI II MINIM MINN 1 II IMMIIMI 

CTGGAACTTGACCCCAACTTATACAGGATAGGGCAGAGCAAAATCTTCTTCCGAACTGGC 


780 


Db 


2335 


2394 


Qy 

Db 


781 
2395 


ValLeuAlaHisLeuGluGluGluArgAspLeuLysIleThrAspVallleMetAlaPhe 

IIIIIMMIIIMMMIIIIIIIIIIIIIIIIIIIIMIIII MM llllllll MM 

GTCCTGGC CCAC CTAG AGG AGG AGCG AGATTTG AAGATC AC CGATGTC ATC ATGG C CTTC 


800 
2454 


Qy 


801 


GlnAlaMetCysArgGlyTyrLeuAlaArgLysAlaPheAlaLysArgGlnGlnGlnLeu 

mMM 1 1 1 M 1 1 1 M IMM M 1 1 Ml II i 1 M III M 1 Ml II 1 M 1 1 II 1 1 M II II 

CAGGCGATGTGTCGTGGCTACTTGGCCAGAAAGGCTTTTGCCAAGAGGCAGCAGCAGCTG 


820 


Db 


2455 


2514 


Qy 


821 


ThrAlaMetLysVallleGlnArgAsnCysAlaAlaTyrLeuLysLeuArgAsnTrpGln 

II II II II Ml M M 1 1 MM M 1 M Ml 1 1 1 MM i M II 1 Ml 1 M M M 1 !M MMI 

ACCGCCATGAAGGTGATTCAGAGGAACTGCGCCGCCTACCTCAAGCTGCGGAACTGGCAG 


840 


Db 


2515 


2574 


Qy 


841 


TrpTrpArgLeuPheThrLysValLysProLeuLeuGlnValThrArgGlnGluGluGlu 

MMMIIIIIIIIMMMII IMIIMMIMIMM Mlllllll llllllll III 

TGGTGGAGGCTTTTCACCAAAGTGAAGCCACTGCTGCAGGTGACACGGCAGGAGGAGGAG 


860 


Db 


2575 


2634 


Qy 


861 


MetGlnAlaLysGluAspGluLeuGlnLysThrLysGluArgGlnGlnLysAlaGluAsn 

llllllll III MM MIIMI MINI II MINIM MINI Nil IIIMIIMI 

ATGCAGG C C AAGGAGG ATG AACTG C AGAAG ACCAAGGAGCGG C AGCAG AAGG C AGAGAAT 


880 


Db 


2635 


2694 


Qy 

Db 


881 
2695 


GluLeuLysGluLeuGluGlnLysHisSerGlnLeuThrGluGluLysAsnLeuLeuGln 

MMMMMIIMI MIMM IIIIIIIIIIIIIIIIIMIII II M MIIMM III 

GAGCTT AAGGAG CTGG AAC AG AAGCACT CGCAGCTGACCG AGGAGAAGAACCTGCT ACAG 


900 
2754 


Qy 


901 


GluGlnLeuGlnAlaGluThrGluLeuTyrAlaGlviAlaGluGluMetArgValArgLeu 

i.MIIIIIIIIII 1 1 II MM MMMMMMIMM MIMM 1 Mlllllll MM 

GAACAGCTGCAGGCAGAGACAGAGCTGTATGCAGAGGCTGAGGAGATGCGGGTGCGGCTG 


920 


Db 


2755 


2814 


Qy 


921 


AlaAlaLysLysGlnGluLeuGluGluIleLeuHisGluMetGluAlaArgLeuGluGlu 

til: Ill INI llllllll II llllllll MINI IMINNII Nil 

GCGG CCAAGAAGC AGGAG CTGGAGG AG AT ACTGC ATG AGATGG AGGC CCG C CTGGAGGAG 


940 


Db 


2815 


2874 


Qy 


941 


GluGluAspArgGlyGlnGlnLeuGlnAlaGliiArgLysLysMetAlaGlnGlnMetLeu 

1 1 1 II II M M 1 M 1 1 1 Ml Ml 1 M II II II llllllll MM III III MM llllll 

GAGGAAG ACAGGGG C C AGCAGCT AC AGGCTGAAAGGAAGAAGATGGCCC AGC AGATGCTG 


960 


Db 


2875 


2934 


Qy 
Db 


961 
2935 


AspLeuGluGluGlnLeuGluGluGluGluAlaAlaArgGlnLysLeuGlnLeuGluLys 

IMMIMMIMI II llllllll MMM II IMMIII MIMM III MMM MM 

GACCTTGAAGAACAGCTGGAGGAGGAGGAAGCTGCCAGGCAGAAGCTGCAACTTGAGAAG 


980 
2994 



Qy 


981 


ValThrAlaGluAlaLysIleLysLysLeuGluAspGluIleLeuValMetAspAspGln 

IIIIIIIMIMilMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMMII 

GTCACGGCTGAGGCCAAGATCAAGAAACTGGAGGATGAGATCCTGGTCATGGATGATCAG 


1000 


Db 


2995 


3054 


Qy 


1001 


AsnAsnLysLeuSerLysGluArgLysLeuLeuGluGluArglleSerAspLeuThrThr 

1 1 1 1 11 h II <l II II 1 1 M 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 h II 1 1 1 1 1 1; 1 1 >M r 1 II 1 1 

AAC AAT AAACTAT C AAAAG AACGAAAACTC CTTG AGG AGAGGATT AGTGACTT AACG AC A 


1020 


Db 


3055 


3114 


Qy 


102i 


AsnLeuAlaGluGluGluGluLysAlaLysAsnLeuThrLysLeuLysAsnLysHisGlu 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAT CTTGC AGAAGAGGAAGAAAAGG C CAAGAATCTTACCAAGCTGAAAAACAAGC ATGAA 


1040 


Db 


3115 


3174 


Qy 


1041 


SerMetlleSerGluLeuGluValArgLeuLysLysGluGluLysSerArgGlnGluLeu 

h II 1 1 1 1 M 1 1 1 ; 1 1 M II II M 1 hill 1 1 i 1 II 1 1 1 1 1 1 1 h: 1 1 1 1 1 M 1 1 1 1 1 1 1 

TCTATGATTTCAGAACTGGAAGTGCGGCTAAAGAAGGAAGAGAAGAGCCGACAGGAGCTG 


1060 


Db 


3175 


3234 


Qy 


1061 


GluLysLeuLysArgLysLeuGluGlyAspAlaSerAspPheHisGluGlnlleAlaAsp 

llllllll llllll II llllllllllllllllllll II 11111111111111 llllllll 

GAGAAG CTGAAACGGAAGCTGGAGGGTG ATGCCAGCGACTT CC ACG AGC AGATCGCTG AC 


1080 


Db 


3235 


3294 


Qy 


1081 


LeuGlnAlaGlnlleAlaGluLeuLysMetGlnLeuAlaLysLysGluGluGluLeuGln 

llllll 1 Ml II II II II Mllllllllll llllll II 1 MIIMI IIIIIMIII II 1 

CTCC AGGCGC AGAT CGC AG AGCTC AAGATGCAGCTGGC C AAGAAGG AGGAGGAG CTG C AG 


1100 


Db 


3295 


3354 


Qy 


1101 


AlaAlaLeuAlaArgLeuAspAspGluIleAlaGlnLysAsnAsnAlaLeuLysLysIle 

Mil IIIIIMIII Mil IIMIIllllllllllllllllllllllllllllll II II II 

GCGGCCCTGGCCAGGCTTGACGATGAAATCGCTCAGAAGAACAATGCCCTGAAGAAGATC 


1120 


Db 


3355 


3414 


Qy 

Db 


1121 
3415 


ArgGluLeuGluGlyHisIleSerAspLeuGlnGluAspLeuAspSerGluArgAlaAla 

M 1 1 1 M 1 M M 1 1 II II 1 1 M II 1 1 II II II II 1 II 1 1 II II II 1 II II II II 1 1 1 1 II 

CGGGAGCTGGAGGGCCACATCTCAGACCTCCAGGAGGACCTGGACTCAGAGCGGGCCGCC 


1140 
3474 


Qy 


1141 


ArgAsnLysAlaGluLysGlnLysArgAspLeuGlyGluGluLeuGluAlaLeuLysThr 

llllll IMIIMI MM IMMMMMMIMM II Mllllllllll MM II II II 

AGGAACAAGGCTGAAAAGCAGAAGCGAGACCTCGGCGAGGAGCTGGAGGCCCTAAAGACA 


1160 


Db 


3475 


3534 


Qy 


1161 


GluLeuGluAspThrLeuAspSerThrAlaThrGlnGlnGluLeuArgAlaLysArgGlu 

MM II II MM II MM IMMIIMIMMMMII Mllllllllll MM Nil II 

GAGCTGGAAGACACACTGGACAGCACAGCCACTCAGCAGGAGCTCAGGGCCAAGAGGGAG 


1180 


Db 


3535 


3594 


Qy 


1181 


GlnGluValThrValLeuLysLysAlaLeuAspGluGluThrArgSerHisGluAlaGln 

II II MM IIIIIMIII II II IIIIMIIIMIII II llllll llllll II IMIIMI 

CAGGAGGTGACGGTGCTGAAGAAGGCCCTGGATGAAGAGACGCGGTCCCATGAGGCTCAG 


1200 


Db 


3595 


3654 


Qy 


1201 


ValGlnGluMetArgGlnLysHisAlaGlnAlaValGluGluLeuThrGluGlnLeuGlu 

MM II II II II II II IMIIIIMIIMIIMIIIIIIIIIMIIIIMII IMIIMI 

GTCCAGG AGATG AGGC AGAAACACGC ACAGGCGGTGG AGG AG CT CAC AGAGC AG CTTG AG 


1220 


Db 


3655 


3714 


Qy 


1221 


GlnPheLysArgAlaLysAlaAsnLeuAspLysAsnLysGlnThrLeuGluLysGluAsn 

llllll n it ii ii mi i : 1 1 1 M 1 1 1 1 1 1 1 1 1 1 ;< 1 1 1 1 1 1 1 iiiiiiini n ii 

C AGTT CAAGAGGGCCAAGGCG AAC CT AG ACAAGAATAAGCAGACGCTGGAGAAAG AG AAC 


1240 


Db 


3715 


3774 


Qy 


1241 


AlaAspLeuAlaGlyGluLeuArgValLeuGlyGlnAlaLysGlnGluValGluHisLys 

1: II 1 1 II illllill M II II il IhMlllli; i llllllll MINI Ihilihl 

GCAGACCTGGCCGGGGAGCTGCGGGTCCTGGGCCAGGCCAAGCAGGAGGTGGAACATAAG 


1260 


Db 


3775 


3834 


Qy 


1261 


LysLysLysLeuGlviAlaGlnValGlnGluLeuGlnSerLysCysSerAspGlyGluArg 

IMIIMI IMIIMI M 1 1 1 1 1 i 1 1 1 1 ! 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIIMI 

AAGAAG AAGCTGG AGG CGCAGGTGCAGGAG CTGCAGTCC AAGTGCAGCGATGGGGAGCGG 


1280 


Db 


3835 


3894 


Qy 


1281 


AlaArgAlaGluLeuAsnAspLysValHisLysLeuGlnAsnGluValGluSerValThr 

Nil 1 III llllllllllllllllllllllllll IIIIIMIII llllll IIIIIMIII 

GCC CGGGCGGAGCTC AATGAC AAAGTCCAC AAG CTGCAG AATG AAGTTGAG AGCGTCACA 


1300 


Db 


3895 


3954 


Qy 


1301 


GlyMetLeuAsnGluAlaGluGlyLysAlalleLysLeuAlaLysAspValAlaSerLeu 

II II II II III II Ml II II II II II MM MM II IMIIMI llllll IIIIIMIII 

GGGATGCTTAACGAGGCCGAGGGGAAGGCCATTAAGCTGGCCAAGGACGTGGCGTCCCTC 


1320 


Db 


3955 


4014 


Qy 


1321 


SerSerGlnLeuGlnAspThrGlnGluLeuLeuGlnGluGluThrArgGlnLysLeuAsn 


1340 



mi iiiiiiiuii ii ii ii ii i mi 1 1 iii iiiiiiiiiiiiiiiiiinnnii 



Db 


4015 


Qy 


1341 


Db 


4075 


Qy 


1361 


Db 


4135 


Qy 


1381 


Db 


4195 


Qy 


1401 


Db 


4255 


Qy 


1421 


Db 


4315 


Qy 


1441 


Db 


4375 


Qy 


1461 


Db 


4435 


Qy 


1481 


Db 


4495 


Qy 


1501 


Db 


4555 


Qy 


1521 




f 


Db 


4TS15 


Qy 


1541 


Db 


4675 


Qy 


1561 


Db 


4735 


Qy 


1581 


Db 


4795 


Qy 


1601 


Db 


4855 


Qy 


1621 


Db 


4915 


Qy 


1641 


Db 


4975 


Qy 


1661 


Db 


5035 


Qy 


1681 



4015 AGTTCCCAGCTCCAGGACACCCAGGAGCTGCTTCAAGAAGAAACCCGGCAGAAGCTCAAC 4074 



INI III llllllllllllll llllll II MINIM I IINIIMM llllll Mill 



III IMMMMMMMIMM II lllllll I IMIIII II IMIIMI 



SerAspSerLysLysLysLeuGlnAspPheAlaSerThrValGluAlaLeuGluGluGly 14 00 

I llllllllllllll lllllllll llllll I llllllllllllll II IIIIIMI Mill 

TCCGACTCGAAGAAGAAGCTGCAGGACTTTGCCAGCACCGTGGAAGCTCTGGAAGAGGGG 4254 



MM MMMMIMMMM IMIMM I lilllll I MIIIMI MUM MM I M 

AAGAAG AGGTT CCAGAAGG AG ATCGAGAAC CT CACC CAGCAGTACGAGGAGAAGG CGGCC 4314 
AlaTyrAspLysLeuGluLysThrLysAsnArgLeuGlnGlnGluLeuAspAspLeuVal 1440 

MM IMMIMIM IMMI IMMIII I IMIMI I IMMMMMMMM 1 1 M 

G CTT ATG AT AAACTGGAAAAG ACCAAG AACAGGCTT CAG C AGGAGCTGGACG AC CTGGTT 43 74 
ValAspLeuAspAsnGlnArgGlnLeuValSerAsnLeuGluLysLysGlnArgLysPhe 14 60 

IMIMM MIMMMMI IMMIII I MMMI I IMIIMI MMMMM I II 

GTTGATTTGGACAACCAGCGGCAACTCGTGTCCAACCTGGAAAAGAAGCAGAGGAAATTT 4434 
AspGlnLeuLeuAlaGluGluLysAsnlleSerSerLysTyrAlaAspGluArgAspArg 14 80 

I MM III llllllllllllllllllllll MINIM IMIIII II II IIIIIMI II I 

GATCAGTTGTTAGCCGAGGAGAAAAACATCTCTTCCAAATACGCGGATGAGAGGGACAGA 44 94 



Mllllllllllll lllllllll I llllll I IMIIII II IMIIMI 



GluAlaLeuGluAlaLysGluGluLeuGluArgThrAsnLysMetLeuLysAlaGluMet 1520 

1 1 II II 1 1 M 1 1 1 II :M I M I IMMI 1 1 MM M M I II 1 1 M 1 1 1 1 II I M 1 1 1 

GAGGCCTTGG AAGC C AAAGAGGAACTCG AG CGG ACCAACAAAATGCT CAAAG C CG AAATG 4614 



I II M 1 1 M I M M II 1 1 1 1 1 1 1 MM 1 1 1 1 IM M 1 1 1 Ml Ml I M MM 1 1 M M 

415. 1 5 GAAGACCTGGTCAGCTCCAAGGATGACGTGGGCAAGAACGTCCATGAGCTGGAGAAGTCC 4674 



I 1 1 M III I 1 1 III I 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 II II I I 1 1 MM I 1 1 1 1 M 1 1 1 1 1 1 M 

AAGCGGGCCCTGGAGACCCAGATGGAGGAGATGAAGACGCAGCTGGAAGAGCTGGAGGAC 
GluLeuGlnAlaThrGluAspAlaLysLeuArgLeuGluValAsnMetGlnAlaLeuLys 

I llllllllllllll llllll IMMMMMIIMM IMIIII II IIIIIMI Mill 

GAGCTGCAAGCCACGGAGGACGCCAAACTGCGGCTGGAAGTCAACATGCAGGCGCTCAAG 
GlyGlnPheGluArgAspLeuGlnAlaArgAspGluGlnAsnGluGluLysArgArgGln 

IIIIIIIIIIIIIIIIIMIIIIIIIIIMM IMIMMMMIMIIIIIIMMII 



I II INI II II MM MINI III IIIIIMI II II NIIIIMI II llllll llllll I 

CTGCAGAGACAGCTTCACGAGTATGAGACGGAACTGGAAGACGAGCGAAAGCAACGTGCC 4914 



I MM III IIIIIMI llllll IMMIII IMMMI IMMil II il IMIIMI II 

CTGGCAGCTGCAGCAAAGAAGAAGCTGGAAGGGGACCTGAAAGACCTGGAGCTTCAGGCt 
AspSerAlalleLysGlyArgGluGluAlalleLysGlnLeuArgLysLeuGlnAlaGli 

I llllllllllllll llllllllllllllllllllll I lllllllll IIIIIMI Nil 



MetLysAspPheGlnArgGluLeuGluAspAlaArgAlaSerArgAspGluIlePheAla 1680 

I II M IMMMIIMMIIIMMMMMI II M 1 1 MIIIIMI IMMI III MM 

ATGAAGGACTTTCAAAGAGAGCTGGAAGATGCCCGTGCCTCCAGAGATGAGATCTTTGCC 5094 
ThrAlaLysGluAsnGluLysLysAlaLyaSerLeuGluAlaAspLeuMetGlnLeuGln 1700 



Mill lllllllllllilllllllllllllllllll lilllllllllllllllllllll 

9 5 ACAGCCAAAGAGAATGAGAAGAAAGCCAAGAGCTTGGAAGCAGACCTCATGCAGCTACAA 



Db 


5095 


ACAG CCAAAG AGAATGAG AAG AAAGCCAAGAGCTTGGAAGC AGAC CTCATGCAG CT AC AA 


5154 


Qy 


1701 


GluAspLeuAlaAlaAlaGluArgAlaArgLysGlnAlaAspLeuGluLysGluGluLeu 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIM 

G AGG ACCT CG C CGCCGCTGAG AGGGCTCGCAAACAAGCGGAC CT CG AGAAGGAGGAACTG 


1720 


Db 


5155 


5214 


Qy 


1721 


AlaGluGluLeuAlaSerSerLeuSerGlyArgAsnAlaLeuGlnAspGluLysArgArg 

MIM MUM MMMMMMMMMMIMM MIMI IMMIIMI IMIIIIII 


1740 


Db 


5215 


GCAGAGGAG CTGG CC AGT AGC CTGTCGGG AAGG AACGCACTCCAGGACG AG AAGCGCCGC 


5274 


Qy 


1741 


LeuGluAlaArglleAlaGlnLeuGluGluGluLeuGluGluGluGlnGlyAsnMetGlu 

inn linn ii iiiiiiiiiiiiiii iiiiiin niiiiii iiiiiii iniiiii 

CTGGAGGCC CGG ATCGCCCAG CTGG AGGAGG AGCTGGAGGAGGAGCAGGGC AAC ATGG AG 


1760 


Db 


5275 


5334 


Qy 


1761 


AlaMetSerAspArgValArgLysAlaThrGlnGlnAlaGluGlnLeuSerAsnGluLeu 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCCATGAGCGACCGGGTCCGCAAAGCCACACAGCAGGCCGAGCAGCTCAGCAACGAGCTG 


1780 


Db 


5335 


5394 


Qy 


1781 


AlaThrGluArgSerThrAlaGlnLysAsnGluSerAlaArgGlnGlnLeuGluArgGln 

MM M MM M M Mi M M MM M M M M M MMMM M M M M M M M M 1 

GCC AC AG AGCGCAGC ACGGCCCAGAAGAATG AGAGTG CCCGGCAG CAGCTCGAG CGGC AG 


1800 


Db 


5395 


5454 


Qy 

Db 


1801 
5455 


AsnLysGluLe^rgSerLysLeuHisGluMetGluGlyAlaValLysSerLysPheLys 

MM II Mill MMMMMMMIMMIMM IIIIIII II IIIIIII II II 11 III 

AACAAGGAGCTCCGGAGCAAGCTCCACGAGATGGAGGGGGCCGTCAAGTCCAAGTTCAAA 


1820 
5514 


Qy 


1821 


SerThrlleAlaAlaLeuGluAlaLysIleAlaGlnLeuGluGluGlnValGluGlnGlu 

Mill 1 III II MMMMMMMIMMIMM IIIIIII IIIIIIIIIIIIIII III 
T CCACC AT CGCGG CGCTGG AGGCCAAG ATTGC ACAGCTAGAGGAGC AGGT CG AGC AGGAG 


1840 


Db 


5515 


5574 


Qy 


1841 


AlaArgGluLysGlnAlaAlaThrLysSerLeuLysGlnLysAspLysLysLeuLysGlu 

Mill 1 Mill MMMMMMMIMMIMM MMMM IIIIIII MUM III 

GCCAGAGAG AAAC AGGCGGCC ACCAAGT CGCTG AAG CAGAAAGACAAGAAG CTGAAGGAA 


1860 


Db 


5575 


5634 


Qy 


1861 


IleLeuLeuGlnValGluAspGluArgLysMetAlaGluGlnTyrLysGluGlnAlaGlu 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 


1880 


DO 




t: c qa 
d d y*± 


Qy 


1881 


LysGlyAsnAlaArgValLysGlnLeuLysArgGlnLeuGluGluAlaGluGluGluSer 

Mill 1 Mill MIMMMMMMMIMMMI MMMM IIIIIII II II II III 

AAAGGC AATGC C AGGGTCAAGC AGCTC AAG AGGCAGCTGGAGGAGGCAGAGG AGG AGTC C 


1900 


Db 


5695 


5754 


Qy 


1901 


GlnArglleAsnAlaAsnArgArgLysLeuGlnArgGluLeuAspGluAlaThrGluSer 

MM II Mill MMMMMMMIMMIMM IIIIIII II IIIIIII MINI III 

CAGCGC ATC AACG CC AACCGC AGGAAG CTGC AGCGGGAGCTGGATG AGGC C ACGG AG AG C 


1920 


Db 


5755 


5814 


Qy 


1921 


AsnGluAlaMetGlyArgGluValAsnAlaLeuLysSerLysLeuArg 1936 

MMIIMMIMMMMMMMMMMIMMMIMIMIMI 

AACG AGGC CATGGGCCGCGAGGTGAACGCACTCAAGAGCAAGCTC AGG 5862 




Db 


5815 





GenPept Accession Number P35749 
MYHB HUMAN 



ID MYHB_HUMAN STANDARD; PRT; 1972 AA. 

AC P35749; 000396; 094944; P78422; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 16 -OCT- 2001 (Rel. 40, Last sequence update) 

DT 15 -MAR- 2004 (Rel. 43, Last annotation update) 

DE Myosin heavy chain, smooth muscle isoform (SMMHC) . 

GN MYH11 OR KIAA0866. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
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RN [5] 

RP SEQUENCE OF 1093-1972 FROM N.A. 

RC TISSUE=Hippocampus ; 

RA Okajima K. ; 

RL Submitted (NOV-1992) to the embl/ GenBank/DDB J databases. 
CC FUNCTION: Muscle contraction. 

CC -!- SUBUNIT: Muscle myosin is a hexameric protein that consists of 2 

CC heavy chain subunits (MHC) , 2 alkali light chain subunits (MLC) 

CC and 2 regulatory light chain subunits (MLC-2) . 

CC -!- SUBCELLULAR LOCATION: Thick filaments of the myofibrils. 

CC -!- TISSUE SPECIFICITY: Smooth muscle; expressed in the umbilical 

CC artery, bladder, esophagus and trachea. 

CC -!- DOMAIN: The rodlike tail sequence is highly repetitive, showing 

CC cycles of a 28 -residue repeat pattern composed of 4 hep tapept ides, 

CC characteristic for alpha-helical coiled coils. 

CC -!- DISEASE: A chromosomal rearrangement, known as pericentric 

CC inversion Inv(16) (pl3;q22). Produces a fusion protein that 

CC consists of the 165 N- terminal residues of CBF-beta (PEPB2) with 

CC the tail region of MYH11. This rearrangement is associated with 

CC acute myeloid leukemia of M4E0 subtype. 

CC - ! - MISCELLANEOUS : Each myosin heavy chain can be split into 1 light 
CC meromyosin (LMM) and 1 heavy meromyosin (HMM) . It can later be 

CC split further into 2 globular subfragments (SI) and 1 rod-shaped 

CC subfragment (S2) . 

CC -!- SIMILARITY: Contains 1 myosin-like globular head domain. 
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-!- SIMILARITY: Contains 1 IQ domain. 

-!- DATABASE: NAME=Atlas Genet. Cytogenet . Oncol. Haematol.; 

WWW="http : //www. infobiogen.fr/services/chromcancer/Genes/MYHHID43 .html" 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; AF001548; AAC31665.1; 

EMBL; U91323; AAC35212.1; 

EMBL; AB020673; BAA74 889.2; ALT_INIT . 

EMBL; D10667; -; NOT_ANNOTATED_CDS . 

EMBL; X69292; CAA4 9154.1; -. 

HSSP; P10587; 1BR2 . 

Genew; HGNC:7569; MYH11. 

MIM; 160745; -. 

GO; GO: 0005859; C:muscle myosin; TAS . 
InterPro; IPR000048; IQ_region. 
Inter Pro; I PRO 016 09; myosin_head. 
InterPro; IPR004009; Myosin_N. 
InterPro; I PRO 02 92 8; Myosin_tail. 
InterPro; IPR002017; Spectrin. 
Pfam; PF00612; IQ; 1. 
Pfam; PF00063; myosin_head; 1. 
Pfam; PF02736; Myosin_N; 1. 
Pfam; PF01576; Myosin_tail; 1. 
PRINTS; PR00193; MYOSINHEAVY . 
ProDom; PD000355; myosin_head; 1. 
SMART; SM00015; IQ; 1. 
SMART; SM00242; MYSC; 1. 
PROSITE; PS50096; IQ; 1. 

Myosin; Muscle protein; Coiled coil; Thick filament; Act in-binding ; 
Calmodulin -binding; ATP -binding; Methyl at ion; Multigene family; 



r Proto- oncogene; 


Chromosomal 


translocation. 


DOMAIN 


1 


785 


MYOSIN HEAD -LIKE. 


DOMAIN 


786 


815 


IQ. 


DOMAIN 


844 


1934 


COILED COIL (POTENTIAL) . 


DOMAIN 


1935 


1972 


CARBOXYL- TERMINAL , 


NP_BIND 


178 


185 


ATP (POTENTIAL) . 


DOMAIN 


661 


683 


ACTIN -BINDING (BY SIMILARITY) . 


DOMAIN 


762 


776 


ACTIN-BINDING (BY SIMILARITY) . 


MOD_RES 


129 


129 


METHYLAT I ON (TRI-) (POTENTIAL) . 


CONFLICT 


887 


889 


EEK -> NSE (IN REF . 4) . 


CONFLICT 


1558 


1558 


T -> S (IN REF. 4) . 


CONFLICT 


1610 


1611 


KQ -> NE (IN REF. 4) . 


CONFLICT 


1786 


1786 


A -> S (IN REF. 5) . 


CONFLICT 


1958 


1958 


T -> L (IN REF. 4) . 


> SEQUENCE 


1972 


AA; 227338 


MW; 67665BB2AECE1277 CRC64 ; 


Query Match 




98.9%; 


Score 9602.5; DB 1; Length 19 



Best Local Similarity 99.6%; 
Matches 1929; Conservative 



Pred. No. 3e-2 79j 
0; Mismatches 



0; Indels 



7 ; Gaps 



1; 



Qy 

Db 

Qy 

Db 
Qy 

Db 



1 MAQKGQLSDDEKFLFVDKNFINS P VAQ ADWAAKRLVWVP S E KQG FE AAS I KEEKGDEVW 

Mini ; iiMiiiiiiiiii mill mum n mm minimi mini 

1 MAQKGQLSDDEKFLFVDKNFINSPVAQADWAAKRLVWVPSEKQGFEAASIKEEKGDEVW 



60 



60 



61 ELVENGKKVTVGKDDIQKMNPPKFSKVEDMAELTCLNEASVLHNLRERYFSGLIYTYSGL 120 

II Ihi 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 h 1 1 i 1 1 III IN I Ml il II 1 1 1 1 1 MM hi 1 1 

61 ELVENGKKVTVGKDDIQKMNPPKFSKVEDMAELTCLNEASVLHNLRERYFSGLIYTYSGL 120 

121 FCVWNPYKHLPIYSEKIVDMYKGKKRHEMPPHIYAIADTAYRSMLQDREDQSILCTGES 180 

INI II I II 1 1 1 1 i 1 1 1 M 1 1 1 1 1 1 M i 1 1 1 1 II 1 1 1 M I II h M 1 1 : 1 1 1 1 1 II II 1 1 

121 FCVWWPYKHLPIYSEKIVDMYKGKKRHEMPPHIYAIADTAYRSMLQDREDQSILCTGES 180 



Qy 181 GAGKTENTKKVIQYLAWASSHKGKKDTSITQGPSFAYGELEKQLLQANPILEAFGNAKT 240 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 II II II II 1 1 MM II 



Db 


181 


GAGKTENTKKVIQYLAWASSHKGKKDTSIT GELEKQLLQANPILEAFGNAKT 


233 


Qy 


241 


VKNDNSSRFGKFIRINFDVTGYIVGANIETYLLEKSRAIRQARDERTFHIFYYMIAGAKE 

MMMIMM II 1 Mill IMIIMM MIMMMMIMI MMMIMMMMM 

VKNDNSSRFGKFIRINFDVTGYIVGANIETYLLEKSRAIRQARDERTFHIFYYMIAGAKE 


300 


Db 


234 


293 


Qy 


301 


KMRSDLLLEGFNNYTFLSNGFVPIPAAQDDEMFQETVEAMAIMGFSEEEQLSILKVVSSV 

Ml II lllllll 1 Mill IIIIIMIII IIIMMI lllllll 1 MMMI IMIIMI 

KMRSDLLLEGFNNYTFLSNGFVPIPAAQDDEMFQETVEAMAIMGFSEEEQLSILKWSSV 


360 


Db 


294 


353 


Qy 


361 


LQLGN I VFKKERNTDQAS M PDNT AAQKVCHLMG I NVTD FTRS I LTPRI KVGRDWQKAQT 

1 II II II 1 II 1 1 1 1 Ml 1 1 1 1 1 li II II 1 1 1 1 II 1 1 1 M 1 1! 1 1 1 II I 1 1 1 1 : 1; l : l 1 1 


420 


Db 


354 


LQLGN I VFKKERNTDQASMPDNTAAQKVCHLMG I NWD FTRS I LTPRI KVGRDWQKAQT 


413 


Qy 

Db 


421 
414 


KEQADFAVEALAKAT YERLFRWI LTRVNKALDKTHRQGAS FLG I LD I AGFE I FEVNSFEQ 

III II llli;i 1 llllll II IIIIMIIIIIIIMI Mill 1 1 llllllll llllllll 1 

KEQADFAVEALAKAT YERLFRWILTRVNKALDKTHRQGAS FLG I LD I AGFE I FEVNSFEQ 


480 
473 


Qy 


481 


LCI NYTNE KLQQLFNHTMF I LEQEE YQREG I E WNF IDFGLDLQPCIELIER PNNP PG VLA 

IMMMMII 1 1 MMMI IM!llll MIMIII IMM M IMIMMMiM: III 

LCI NYTNE KLQQL FNHTM F I LEQEE YQREG I E WNF IDFGLDLQPCIELI E RPNNP PG VLA 


540 


Db 


474 


533 


Qy 


541 


LLDEECWFPKATDKSFVEKLCTEQGSHPKFQKPKQLKDKTEFS I IHYAGKVDYNASAWLT 

IMMMMII 1 1 IMMMIMMIM llllllll MIMM IMIM MM MMMI 

LLDEECWFPKATDKSFVEKLCTEQGSHPKFQKPKQLKDKTEFS I IHYAGKVDYNASAWLT 


600 


Db 


534 


593 


Qy 

Db 


601 
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KNMD PLNDNVTSLLNAS SDKFVADLWKDVDRI VGLDQMAKMTESSLPSAS KTKKGMFRTV 

III II II III 1 llllll llllllllllll MINI M MM lllllllll llllllll 

KNMDPLNDNVTS LLNAS SDKFVADLWKDVDRI VGLDQMAKMTESSLPS AS KTKKGMFRTV 


660 
653 


Qy 


661 


GQL YKEQLGKLMTT LRNTT PNF VRC 1 1 PNH E KRSGKLD AFLVLEQLRCNG VLEG I R I CRQ 

IMIMIMI 1 M IMMMIMMIM IIIMIMM 1 1 Mill IIMIMMMMM 

GQLYKEQLGKLMTTLRNTTPNFVRCI I PNH EKRSGKLDAFLVLEQLRCNG VLEG I RI CRQ 
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GFPNRIVFQEFRQRYEILAANAIPKGFMDGKQACILMIKALELDPNLYRIGQSKIFFRTG 

1 1 1 1 1 1 Mill 1 II II II II II III II MMMI 1 M 1 !M 1 1 1 M 1 II II II II II II 1 

G F PNR I VFQE FRQR YE I LAANA I PKG FMDGKQAC I LM I KALE LD PNL YRI GQSKIFFRTG 


780 

773 


Qy 
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VLAHLEEERDLKI TD VI MAFQAMCRGYLARKAFAKRQQQLTAMKVI QRNCAAYLKLRNWQ 

Ml 1 M 1 Ml 1 1 1 Ml M II 1 Ml III 1 II MM II 1 1 1 II M II II II iM Mill 1 1 1 


840 


Db 
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VLAHLEEERDLKITDVIMAFQAMCRGYLARKAFAKRQQQLTAMKVIQRNCAAYLKLRNWQ 


833 


Qy 
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WWRLFTKVKPLLQVTRQEEEMQAKEDELQKTKERQQKAENELKELEQKHSQLTEEKNLLQ 

II 1 1 1 1 MM 1 1 II 1 Ml 1 M III MM 1 Mill M MM 1 II II M IM 1 M M M II i 

WWRLFTKVKPLLQVTRQEEEMQAKEDELQKTKERQQKAENELKELEQKHSQLTEEKNLLQ 
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Db 
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Qy 
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EQLQAETELYAEAEEMRVRLAAKKQELEEILHEMEARLEEEEDRGQQLQAERKKMAQQML 

Ml II II MIIIIMIII II MM MM MIIIMI II Mill IIIMMI llllllll 

EQLQAETELYAEAEEMRVRLAAKKQELEEILHEMEARLEEEEDRGQQLQAERKKMAQQML 
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Qy 
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DLEEQLEEEEAARQKLQLEKVTAEAKIKKLEDEILVMDDQNNKLSKERKLLEERISDLTT 

: M 1 M 1 II II 1 1 I MM II II M III 1 II IM 1 1 1 1 1 1 II II II II II M 1 IMM M 1 

DLEEQLEEEEAARQKLQLEKVTAEAKIKKLEDEILVMDDQNNKLSKERKLLEERISDLTT 
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Db 
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Qy 
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NLAEEEEKAKNLTKLKNKHESMISELEVRLKKEEKSRQELEKLKRKLEGDASDFHEQIAD 


1080 


Db 
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II 1 1 1 M 1 M 1 II II M M M M 1 1 1 II 1 1 1 1 1 1 1 II 1 1 M II II 1 1 1! II M 1 M M M 

NLAEEEEKAKNLTKLKNKHESMISELEVRLKKEEKSRQELEKLKRKLEGDASDFHEQIAD 
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LQAQIAELKMQLAKKEEELQAALARLDDEIAQKNNALKKIRELEGHISDLQEDLDSERAA 

1 1 1 1 t 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 
1 II I 1 I I 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 f 1 M 1 1 1 

LQAQ I AELKMQLAKKE EELQAALARLDDE I AQKNNALKKI RELEGHI SDLQEDLDS ERAA 
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Db 
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Qy 
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RNKAEKQKRDLGEELEALKTELEDTLDSTATQQELRAKREQEVTVLKKALDEETRSHEAQ 

MM 1 Mhll IIIMIIillMIMIIIIIMI 1 Mill llllllll MM II II III 

RNKAEKQKRDLGEELEALKTELEDTLDSTATQQELRAKREQEVTVLKKALDEETRSHEAQ 
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Qy 


1201 


VQEMRQKHAQAVEELTEQLEQFKRAKANLDKNKQTLEKENADLAGELRVLGQAKQEVEHK 

1 1 1 1 1 IMM 1 1 II 1 Ml II II III M MM 1 IM 1 1 M M II 1! M II II M II II II 

VQEMRQKHAQAVEELTEQLEQFKRAKANLDKNKQTLEKENADLAGELRVLGQAKQEVEHK 
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Db 
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KKKLEAQVQELQSKCSDGERARAELNDKVHKLQNEVESVTGMLNEAEGKAIKLAKDVASL 


1320 



Db 


1254 
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Db 
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MllllimillllllllllMIIIMIMIIIIIIIMIIIIIimillMMIIII 

KKKLEAQVQELQSKCSDGERARAELNDKVHKLQNEVESVTGMLNEAEGKAIKLAKDVASL 
SSQLQDTQELLQEETRQKLNVSTKLRQLEEERNSLQDQLDEEMEAKQNLERHISTLNIQL 

lllllllllllll IIIIIUIIIIIIIIIIIIIIIIIIIIMIMIIMMIMIIMI 

SSQLQDTQELLQEETRQKLNVSTKLRQLEEERNSLQDQLDEEMEAKQNLERHISTLNIQL 
SDSKKKLQDFASTVEALEEGKKRFQKEIENLTQQYEEKAAAYDKLEKTKNRLQQELDDLV 

IIMIMIIIMIIillllllllllllllllllllllMMIIIIIIIIIIMIIIIIII 

SDSKKKLQDFASTVEALEEGKKRFQKE I ENLTQQYEEKAAAYDKLEKTKNRLQQELDDLV 
VDLDNQRQLVS NLE KKQRKFDQLLAEE KN I S S KYADE RDRAE AE ARE KETKAL S LARALE * 

1 1 li li I M 1 1 II N II IMI II M INI! 1 1 1 1 1 1 1 1 II li II 1 1 M II 1 1 1 1 1 1 1 1 1 1 

VDLDNQRQLVSNLEKKQRKFDQLLAEEKNI S SKYADERDRAEAEAREKETKALS LARALE 
EALEAKEELERTNKMLKAEMEDLVSSKDDVGKNVHELEKSKRALETQMEEMKTQLEELED 

M |l:| li; 1 1 II I! II II II II 1 1 II iM 1 1 1 1 M 1 1 II II II 1 1 1 1 II i 1 1 1 1 1 1 1 1 1 

EALEAKEELERTNKMLKAEMEDLVSSKDDVGKNVHELEKSKRALETQMEEMKTQLEELED 
ELQATEDAKLRLEVNMQALKGQFERDLQARDEQNEEKRRQLQRQLHEYETELEDERKQRA 

II Ml I ill Mil Nil II II I M 1 . 1 : 1 1 1 1 1 1 II HUM Illlllllllllllll II 

ELQATEDAKLRLEVNMQALKGQFERDLQARDEQNEEKRRQLQRQLHEYETELEDERKQRA 
LAAAAKKKL EGDLKD LE LQ ADS A I KGRE E A I KQLRKLQAQMKD FQRELED ARAS RD E I FA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II M M It 1 1 1 It 1 1 M 1 1 1 It 1 1 1 1 1 1 M 

LAAAAKKKLEGDLKDLELQADSAIKGREEAIKQLRKLQAQMKDFQRELEDARASRDE I FA 
T AKENE KKAKS LE ADLMQLQED LAAAE RARKQADLE KE E LAE E LAS S LSGRNALQDE KRR 

1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TAKENE KKAKS LE ADLMQLQEDLAAAE RARKQADLE KE E LAE E LAS S LSGRNALQD E KRR 
LEARIAQLEEELEEEQGNMEAMSDRVRKATQQAEQLSNELATERSTAQKNESARQQLERQ 

M 1 1 1 1 II I II II 1 1 1 1 1 1 1 i 1 1 II II II M 1 1 1 1 1 1 II M 1 1 1 1 II II II II II M II 1 1 

LEARIAQLEEELEEEQGNMEAMSDRVRKATQQAEQLSNELATERSTAQKNESARQQLERQ 

NKELRSKLHEMEGAVKSKFKSTIAALEAKIAQLEEQVEQEAREKQAATKSLKQKDKKLKE 

|| I I || II I M II I I M I I I I I I II II II I I M I I I II M M I I I II II I M II II I I II 
NKELRSKLHEMEGAVKSKFKSTIAALEAKIAQLEEQVEQEAREKQAATKSLKQKDKKLKE 

ILLQVEDERKMAEQYKEQAEKGNARVKQLKRQLEEAEEESQRINANRRKLQRELDEATES 

II 1 1 1 II II II II II II I II I M II II 1 1 1 1 1 1 1 M II I II M M 1 1 II II 1 1 1 M 1 1 II 

ILLQVEDERKMAEQYKEQAEKGNARVKQLKRQLEEAEEESQRINANRRKLQRELDEATES 

NE AMGRE VNALKS KLR 1936 
Illlllllllllllll 



